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FAST SIMULATION OF NEW COINS FROM OLD 

By §erban Nacu and Yuval Peres 1 

University of California, Berkeley 

Let S C (0, 1). Given a known function /: S — » (0, 1), we consider 
the problem of using independent tosses of a coin with probability of 
heads p (where p G S is unknown) to simulate a coin with probability 
of heads f(p). We prove that if S is a closed interval and / is real 
analytic on S, then / has a fast simulation on S (the number of p-coin 
tosses needed has exponential tails). Conversely, if a function / has 
a fast simulation on an open set, then it is real analytic on that set. 

1. Introduction. We consider the problem of using a coin with proba- 
bility of heads p (p unknown) to simulate a coin with probability of heads 
f(p), where / is some known function. By this we mean the following: we 
are allowed to toss the original p-coin as many times as we want. We stop at 
some (almost surely) finite stopping time N, and depending on the outcomes 
of the first N tosses, we declare heads or tails. We want the probability of 
declaring a head to be exactly f(p). 

This problem goes back to von Neumann's 1951 article [13], where he 
describes an algorithm which simulates the constant function f{p) = 1/2. It 
is natural to ask whether this is possible for other functions, and in 1991 
Asmussen raised the question for the function f(p) = 2p, where it is known 
that p £ (0, 1/2) (see [8]). The same question was raised independently but 
later by Propp (see [10]). 

In 1994, Keane and O'Brien [8] obtained a necessary and sufficient con- 
dition for such a simulation to be possible. Consider f : S — > [0,1], where 
S C (0, 1). Then it is possible to simulate a coin with probability of heads 
f(p) for all p S S if and only if / is constant, or / is continuous and satisfies, 
for some n > 1, 
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(1) min(/(p), 1 — f(p)) > min(p, 1 — p) n VpGiS. 

In particular, f(p) = 2p cannot be simulated on (0,1/2), since the in- 
equality (1) cannot hold for p close to 1/2. However, if we are given e > 0, 
then an algorithm exists to simulate a 2p-coin from tosses of a p-coin for 
p€(0,l/2-e). 

The methods in [8] do not provide any estimates on the number N 
of p-coin tosses needed to simulate an /(p)-coin. The stopping time ./V will 
typically be unbounded, and for fast algorithms it should have rapidly decay- 
ing tails. For example, in von Neumann's algorithm [13], the tail probabilities 
satisfy P P (N > n) < (p 2 + (1 — p) 2 )L n / 2 J , so they decay exponentially in n. 

Definition 1 . A function / has a fast simulation on S if there exists an 
algorithm which simulates / on S, and for any p £ S there exist constants 
C > 0,p < 1 (which may depend on p) such that the number N of required 
inputs satisfies ~P p (N > n) <Cp n . 

Remark. If S is closed and / has a fast simulation on S, then we can 
choose constants C, p not depending onp€ S. See Proposition 21 for a proof. 

Theorem 1. For any e > 0, the function f{p) = 2p has a fast simulation 
on [0,1/2-e]. 

Building on this result, we prove: 

Theorem 2. Iff:I—* (0,1) is real analytic on the closed interval I C (0,1) , 
then it has a fast simulation on I . Conversely, if a function has a fast sim- 
ulation, then it is real analytic on any open subset of its domain. 

As the results stated above indicate, there is a correspondence between 
properties of simulation algorithms and classes of functions. Table 1 summa- 
rizes the results of [8, 10] and the present paper on this correspondence. For 
simplicity, in this table we restrict attention to functions / : S i— > T where 
S, T are closed intervals in (0,1). We do not know whether the one-sided 
arrows in the table can be reversed. 

We prove Theorem 1 in Sections 2 and 3. In Section 2 we show that simu- 
lating / is equivalent to finding sequences of certain Bernstein polynomials 
which approximate / from above and below. If the approximations are good, 
then the simulations are fast. In Section 3 we use this to construct a fast 
simulation for the function 2p. We can do this because the Bernstein polyno- 
mials provide exponentially convergent approximations for linear functions. 
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In Section 4 we prove the sufficient (constructive) part of Theorem 2. This 
is done in several steps. First, once we have a fast simulation for 2p, it is easy 
to construct fast simulations for polynomials. Using an auxiliary geometric 
random variable, we also obtain fast simulations for functions which have a 
series expansion around the origin. This proves Theorem 2 for real analytic 
functions that extend to an analytic function on a disk centered at the 
origin. For a general real analytic function, we use Mdbius maps of the form 
(az + b)/ [cz + d) to map a subset of their domain to the unit disk. Since we 
have fast simulations for Mdbius maps, this leads to fast simulations for the 
original function. 

In particular, Theorem 2 guarantees fast simulations for any rational 
function /, over any subset of (0, 1) where e < / < 1 — e. This general- 
izes a result from [10], where the authors prove that any rational function 
/ : (0, 1) — > (0, 1) has a simulation by a finite automaton, which is fast. 

In Section 5 we prove the necessary part of Theorem 2, and in Section 6 we 
describe a very simple algorithm that gives a good approximate simulation 
for the function 2p (the error decreases exponentially in the number of steps). 
In Section 7 we give a simple proof of the fact that any continuous function 
bounded away from and 1 has a simulation. Finally, in Section 8 we mention 
some open problems. 

2. Simulation as an approximation problem. In this section we show that 
a function / can be simulated if and only if it can be approximated by cer- 
tain polynomials, both from below and from above, and the approximations 
converge to /. Furthermore, the speed of convergence of the approximations 
determines the speed of the simulation (i.e., the distribution of the number 
of coin tosses needed). 

Let P p be the law of an infinite sequence X = (Xi, X2, . . .) of i.i.d. coin 
tosses with probability of heads p. By a slight abuse of notation, we also 



Table 1 



Simulation type 




Function class 


Ref. 


Terminating a.s. 




/ continuous 


[8] 


With finite expectation 




/ Lipshitz 


Proposition 23 


With finite kth moment 




fec k 


Proposition 22 


(and uniform tails) 








Fast (with exponential tails) 




f real analytic 


Theorem 2 


Via pushdown automaton 




/ algebraic over Q 


[10] 


Via finite automaton 




/ rational over Q 


[10] 






and /((0,1))C(0,1) 





4 



§. NACU AND Y. PERES 



denote by P p the induced law of the first n tosses X%, . . . ,X n , so for A C 
{0,1}", P p (A) = P p ((X 1 ,...,X n )eA). 

Fix n and consider the first n tosses. Either the algorithm terminates after 
at most n inputs (and in that case, it outputs a 1 or a 0), or it needs more 
than n inputs. Let A n C {0, l} n be the set of inputs where the algorithm 
terminates and outputs 1, and let B n be the set of inputs where either the 
algorithm terminates and outputs 1, or needs more than n inputs. Then 
clearly 

Pp(^4n) < P p (algorithm outputs 1) < ~P p (B n ). 

The middle term is f(p). Any sequence in {0, l} n has probability p k (l — 
p) n ~ k , where k is the number of l's in the sequence, so the lower and upper 
bounds are polynomials of the form J2k c kP k (l — p) n ~ k , with c& nonnegative 
integers. The probability that the algorithm needs more than n inputs is 
Pp(-Bn) — P p (A n ), so if the polynomials are good approximations for /, 
then the number of inputs needed has small tails. 

It is less obvious that a converse also holds: given a function / and a 
sequence of approximating polynomials with certain properties, there exists 
an algorithm which generates /, so that the probabilities of A n and B n as 
defined above are given by the approximating polynomials. We prove this 
in the rest of this section. 

In order to state our result in a compact form, we introduce the following. 

Definition 2. Let q(x,y),r(x,y) be homogeneous polynomials of equal 
degree with real coefficients. If all coefficients of r — q are nonnegative, then 
we write q^r. If in addition q^r, then we write q <r. 

This defines a partial order on the set of homogeneous polynomials of two 
variables. If q X r, then clearly q(x, y) < r(x, y) for all x, y > 0. The converse 
does not hold; for example, xy < x 2 + y 2 for all x,y>0, but xy ^ x 2 + y 2 . 

Proposition 3. If there exists an algorithm which simulates a function 
f on a set S C (0, 1), then for all n > 1 there exist polynomials 

g n (x,y) = J2 (l)a{n,k)x k y n - k , h n (x,y) = J2 (l) b(n,k)x k y n - k 
k=0 ^ ' k=0 ^ ' 

with the following properties: 

(i) < a{n,k) < b(n,k) < 1. 

(ii) Q)a(n, k) and (^)b(n,k) are integers. 

(hi) lim n g n (p, 1 - p) = f(p) = lim n h n (p, 1 - p) for allpeS. 
(iv) For all m <n, we have (x + y) n ~ m gm(x, y) ^ g n (x, y) and h n (x, y) ^ 
(x + y) n - m h m (x,y). 
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Conversely, if there exist such polynomials g n (x , y) , h n (x , y) satisfying (i)- 
(iv), then there exists an algorithm which simulates f on S, such that the 
number N of inputs needed satisfies P P (N > n) = h n (p, 1 — p) — g n (p, 1 — p). 

Proof. Suppose an algorithm exists, consider its first n inputs, and 
define as above A n C {0, l} n to be the set of inputs where the algorithm 
outputs 1, and B n C {0, 1}™ to be the set where the algorithm outputs 1 or 
needs more than n inputs. We also partition A n = \JA n ^ and B n = \JB Ujk 
according to the number k of l's in each word. Then every element in A n ^ 
or B n ^ has probability p k (l — p) n ~ k , so if we define 

a(n,k) = |Ai,fc|/ , b(n, k) = \B n>k \j , 

then 

g n (p, l-p) = Pp(An), h n (p, l-p) = P p (B n ). 

Conditions (i) and (ii) are clearly satisfied, and (iii) also follows easily. As 
discussed above, we have g n (p, 1 — p) < f(p) < h n (p, 1 — p) and ~P p {N > n) = 
h n (p, 1 — p) — g n {Pi 1 — p)\ since the algorithm terminates almost surely, the 
difference must converge to 0. From the definition of A n and B n , it is clear 
that g n (p, is an increasing sequence, and h n (p, 1—p) is decreasing. 

Condition (iv) must hold because of the structure of the sets A n and B n . 
Indeed, let m<n and assume {X\, . . . , X m ) S A m . Then (X\, . . . , X n ) 6 A n , 
whatever values X m+ i, . . . ,X n take. To make this formal, for Ed {0, l} m 
define 

T m , n (E) ={(X 1 ,...,X n )e {0, l} n :(X 1 ,...,X m )e E}. 

That is, T m ^ n (E) is the set obtained by taking each element in E and 
adding at the end all possible combinations of n — m zeroes and ones. Parti- 
tion T m ^ n {E) = \JT% in (E), so that all words in T]^ n (E) have exactly k l's. 
We have T mtTl (A m ) C A n , so T*(A m ) C A n k , so 



\A n> k\ > |T^ jn (j4 m )| — [ k — i ) 1^' 



which is the same as 
(2) ( n )a(n,k)>Y: 



i=0 



k 

n \ , , w \ - I n — m\ I m 



m,i | j 



k J ' ^ V k — i J V i 
■ o 



aim, i 



this is equivalent to g n (x,y) y (x + y) m g m (x, y). A similar observation holds 
for the sets B n , and this completes the proof of (iv). 
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<= Given the numbers a(n,k),b(n,k) satisfying (i)-(iv), we shall define 
inductively sets A n = U^n.fc, B n = {JB n ,k with 

A ntk cB ntk , \A nik \ = (fy a(n,k), \B rhk \ = (™J b(n,k). 

We also want the extra property that if m < n, then T mjn (A m ) C A n and 
T m ,n(B m ) D B n . Then we can construct an algorithm simulating / as follows: 
at step n, output 1 if in A n , output if in B^, continue if in B n — A n . 

We define Ai t o = {0} if o(l,0) = 1, and otherwise. We define A^i = {1} 
if o(l, 1) = 1, and otherwise. Similarly for Bi o an d Since a(l,k) < 
6(1, k), we have Ai jk C Bi >k for A; = 0,1. Condition (iv) guarantees that if 

\A mtk \ = (™ > ) a(m,k) and \B„ hk \ = b(m,k) 
for all k, then 

(3) |T* )n (4n)| < a(n,k) < (fj b(n,k) < \T^ n (B m )\. 

Hence we can construct the sets A n ,B n from the sets A m ,B m as follows. 
We want to have 

( 4 ) T m,n( A m) C A n>k C B n>k C T^ n (B m ). 

In view of (3), this can be done by simply choosing any total ordering of 
the set of binary words of length n with k Vs. We build A nk by starting with 
T^ n (A m ) and then adding elements of n (B m ) in increasing order until we 
obtain the desired cardinality (?)a(n, k). Then we add (?)6(n, k) — (?)a(n, k) 
extra elements to obtain B n ^ k . Of course, A n = [J A n k and B n = (J B n k . It 
is immediate that the sets thus defined have the desired properties, so the 
induction step from m to n = m+ 1 works and the proof is complete. □ 

Remark A. Condition (iv) in Proposition 3 implies that the sequence 
(g n (pi 1 —p))n>i is increasing, and the sequence (h n (p, 1 —p)) n >i is decreas- 
ing ( just set x = p, y = 1 — p) . 

Remark B. It is enough to define the numbers a(n,k) and b(n,k) 
when n takes values along an increasing subsequence rii f oo. Indeed, as- 
sume (iv) holds for m = ni,n = ni+x. Then just like above, we can construct 
the sets A n ,B n from the sets A m ,B m so that (4) holds. Thus we can con- 
struct inductively the sets A ni , B ni . The algorithm is allowed to stop only 
at some if m < n < rij+i, it just continues. This amounts to defining 

A n = Trii,n 

(A ni ),B n = T nun (B ni ) for m < n < n i+ i. In terms of the polyno- 
mials, this means 

g n (x, y) = (x + y) n ~ ni g ni (x, y), h n (x, y) = (x + y) n ~ n 'h ni (a?, y) 
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for m <n< rij+i. This is the same as 

a(n, k) = (k/n)a(n — 1, k — 1) + (1 — k/n)a(n — 1, k), 

b(n, k) = (k/n)b(n - 1, k - 1) + (1 - k/n)b(n - 1, fc), 

for rii <n < n^+i and all < /c < n. In the next section we will use this for 
the subsequence of powers of 2, m = 2 l . Note that it is enough to check 
(iv) for m = rii,n = nj+i, because then the algorithm is well defined and 
(iv) must hold for all m,n. Similarly, it is enough to check (iii) for n = rii, 
because the sequences (g n (p, 1 — p))n>i arid (h n (p, 1 — p)) n >i are monotone. 

Remark C. Finally, condition (ii) in Proposition 3 is not essential. In- 
deed, suppose we find numbers a(n, k) and /3(n, k) satisfying all conditions 
in the proposition, except for (ii). Then if we define 



a(n, k) 



b(n, k) 



(3(n,k) 



a(n, k) 
(5) 

conditions (i) and (ii) are trivially satisfied, and (iv) is satisfied because, for 
arbitrary Xi nonnegative reals and Cj nonnegative integers, 

(6) \J2 C i X i\ l X i\ ' \Y1 CiXi ] - \ X i\ ' 

Finally, (iii) still holds for p ^ 0, 1 because the error introduced in g n and 
h n is at most J2k=o^P k (.^- ~ p) n ~ k , which is exponentially small. 

3. Simulating linear functions. Let e > 0, and let f(p) = (2p) A (1 — 2e). 
Since we are only interested in small e, we also assume e < 1/8. We will use 
Proposition 3 to construct an algorithm which simulates /. As explained 
in Remark B of the previous section, it is enough to define a(n, k) and 
b(n, k) when n is a power of 2. Then the compatibility equations in (iv) are 
equivalent to 

<7) 0( 2„, t) ( 2 ;)>|: [ ,(„, ! )(»)( )1 "_; 
(8) H2»,*)(\")<i>(M)(:)( fc :\ 

These can be nicely expressed in terms of the hypergeometric distribution. 

Definition 3. We say a random variable X has hypergeometric distri- 
bution H(2n,k,n) if 

w p (^K:) (*"*)/(*"' 



<s 
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We require < k < 2n. If we have an urn with 2n balls of which k are 
red, and we select a sample of n balls uniformly without replacement, then 
X is the number of red balls in the sample. 

In terms of the hyper geometric, the compatibility equations (7) and (8) 
become 



We will need some properties of this distribution. 

Lemma 4. If X has distribution H(2n,k,n), then: 

(i) E(X/n) = k/(2n). 

(ii) VarpT/n) = k(2n - k)/(A(2n - l)n 2 ) < l/(2n). 

(hi) lfa>0, thenP(\X/n-k/(2n)\>a)<2exp(-2a 2 n). 

Both (i) and (ii) are standard facts; (hi) is a standard large deviation 
estimate. For a proof, see, for example, [7]. 

Finally, we need a way to find good approximations for /. Proposition 3(iii) 
suggests we can use the Bernstein polynomials. We recall their definition and 
main property. See [12], Chapter 1.4 for more details. 

Definition 4. For any function / : [0, 1] — > R and any integer n > 0, the 
nth Bernstein polynomial of / is Q n {x) = J2k=o f / n) Q,) x k (I — x) n ~ k . 

Proposition 5. // / is continuous, then Q n {x) — > f{x) uniformly on 



If a function is linear on some interval, the Bernstein polynomials provide 
a very good approximation to it; this suggests we could use them to construct 
a fast algorithm for functions such as f(p) = (2p) A (1 — 2e). To prove that 
the compatibility equations (10), (11) hold, we will need the following. 

Lemma 6. Let X be hypergeometric with distribution H(2n,k,n) as de- 
fined in (9), and let f: [0, 1] — > R be any function with \ f\ < 1. Then: 



(i) If f is Lipschitz, with \f(x) — f(y)\ < C\x — y\, then \~Ef(X/n) — 
f(k/(2n))\<C/V2n~. 

(ii) /// is twice differentiable, with \f"\< C, then \ Bf(X/n)-f(k/(2n))\ < 
C/(4n). 

(hi) If f is linear on a neighborhood of k/(2n), so f{t) = Ct + D if 
\t-k/{2n)\ <a, then \E f {X / n) - f {k / {2n))\ < {2\C\ + 4) exp(-2a 2 n). 



(10) 
(11) 



a(2n,k) > Ea(n,X), 
b(2n,k) <Bb(n,X). 



[0,1]. 
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Proof. If (i) holds, then we get 

\Ef(X/n) - f(k/(2n))\ < E\f(X/n) - f(k/(2n))\ 

< CE\X/n-k/(2n)\ 

<C(E\X/n-k/{2n)\ 2 ) 1/2 

= CVar{X/n) 1/2 < C/V2n. 

If (ii) holds, then Taylor's expansion for / gives 

\f(X/n) - /(fc/(2n)) - (X/n - k/(2n))f'(k/(2n))\ 

<(l/2)(X/n-k/(2n)) 2 sup\f"\ 

and E(X/n - k/{2n))f'{k/(2n)) = 0, so 

\Ef(X/n)-f(k/(2n))\ 

= |E(/(X/n) - f(k/(2n)) - (X/n - k/(2n))f'{k/(2n)))\ 

< (C/2)E(X/n-k/(2n)) 2 

= (C/2)Var(X/n) < C/(4n). 

If (iii) holds, then let g(t) = f(t) -Ct-D. We have g = on [k/(2n) - 
a,k/(2n) + a] and \g(t) - g(s)\ < \f(t) - f(s)\ + \C\\t - s\ < 2 + \C\ Vt,sG 
[0,1]. Hence 

\Ef(X/n)-f(k/(2n))\ 

= \Eg(X/n)-g(k/(2n))\ 
<E\g(X/n)-g(k/{2n))\ 
= E\g(X/n)-g(k/(2n))\l 

\X/n-k/2n\>a 

< (2 + \C\)P(\X/n - k/(2n)\ > a) 

< 2(2 + |C|)exp(-2a 2 n). 
This completes the proof of the lemma. □ 

If we specialize the lemma to f(p) = (2p) A (1 — 2e), which is Lipschitz 
with C = 2 and also piecewise linear, we obtain: 

Proposition 7. Let f(p) = (2p) A (1 - 2e), where e < 1/2. For X sat- 
isfying (9), we have: 

(i) \Ef(X/n)-f(k/(2n))\<V2/^Vk,n, 

(ii) \Ef(X/n) -f(k/(2n))\ <8exp(-2e 2 n) if k/(2n) < 1/2 - 2e. 
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Now we are ready to construct the algorithm. We start by defining num- 
bers a(n,k), (3{n,k) which satisfy assumptions (i), (hi) and (iv) in Propo- 
sition 3 [but not (ii)]. First we prove the compatibility equations (10) and 

(11) : 

Lemma 8. Define 

(12) ct{n, k) = f(k/n) = {2k /n) A (1 - 2e). 
Then for X satisfying (9), a(2n, k) > Ea(n, X). 

Proof. This follows from Jensen's inequality, since / is concave. □ 

The upper bound is more complicated. We would like (3{n, k) to be close 
to a{n,k), so that the algorithm is fast. Ideally, the difference should be 
exponentially small. This cannot be done over the whole interval [0, 1], since 
the Bernstein polynomials do not approximate / well near 1/2 — e, where it 
is not linear. To account for this, we also need a term of order 1/y/n, to be 
added if k/n > 1/2 — 3e. Finally, to control the speed of the algorithm for 
small p, we also want (3{n, k) and a(n, k) to be in fact equal if k/n is small. 

To achieve this, consider the following auxiliary functions: 

ri(p) = C x {p - (1/2 - 3e)) + , r 2 {p) = C 2 (p - 1/9)+. 

The positive constants C\ and C 2 will be determined later. Both functions 
are constant, equal to zero for p below a certain threshold, and increase 
linearly above the threshold. They are continuous and convex. 

Lemma 9. Define 

(13) P{n,k) = f{k/n)+r 1 {k/n)V2/n~ + r 2 {k/n)exp{-2£ 2 n). 
If e< 1/8 and X satisfies (9), then (3{2n,k) < E(3{n,X) Mk,n. 

Proof. This amounts to proving 

f{k/{2n))-Ef{X/n) 

<Eri(X/n) v / 27n-ri(A;/(2n))/v / 2/(2n) 

+ Br 2 {X/n) exp(-2e 2 n) - r 2 {k/{2n)) exp(-4e 2 n). 

Since r\ and r 2 are convex, r\{k/ {2n)) < Eri{X/n) and r 2 {k/{2n)) < 
~Er 2 {X/n), so it is enough to show 

\f{k/{2n))-Bf{X/n)\ 

< n{k/{2n)){l - l/v / 2)v / 2/n 

+ r 2 {k/{2n)) exp(-2e 2 n)(l - exp(-2e 2 n)). 
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If k/2n < 1/8, then X/n < k/n < 1/4 < 1/2 - e, so f(X/n) = 2X/n for 
all values of X, so the left-hand side is in fact zero and the inequality holds. 

If 1/8 < k/{2n) < 1/2 — 2e, then we use the second part of Proposition 7 
(the large deviation result). Thus, it suffices to show that 

8 < r 2 (fc/(2ra))(l - exp(-2e 2 n)). 

But r 2 {k/(2n)) > C 2 (l/8 - 1/9) = C 2 /72, so it is enough to choose 

C 2 = 72(l-exp(-2e 2 ))" 1 . 

If k/2n > 1/2 — 2s, we use the first part of Proposition 7. It is enough 
then to show that 1 < r 1 (k/(2n))(l - l/\/2). But r 1 (k/(2n)) > C x e, so it is 
enough to choose C\ = e _1 (l — l/y^)" 1 . This completes the proof of the 
lemma. □ 



We can now restate and prove: 



Theorem 1. For e G (0, 1/8), the function f{p) = 2p A (1 - 2e) has a 
simulation on [0, 1], so that the number of inputs needed, N, satisfies ~P p (N > 
n ) < Cp n , for all n > 1 and p S [0, 1/2 — 4e] . The constants C and p depend 
on e but not on p, and p < 1. 



Proof. We use Proposition 3. First we prove that for a(n, k) and /3(n, k) 
defined in (12) and (13) and 



9n(x,y) = 

k=0 



a{n,k)x y 



k„n~k 



k=0 



h n( x ,y) = J2(^j (3(n, k)x k y 



„k„.n—k 



conditions (i) , (iii) and (iv) are satisfied for the subsequence nj = 2 l . We have 
already proven (iv), and as discussed in the previous section, this implies 
that g n (p, 1— p) is increasing and h n (p, 1—p) is decreasing. By Proposition 5, 
the Bernstein polynomials g n (p, 1 — p) converge to /. Clearly, h n (p, 1 — p) — 
g n {p, 1 — p) < sup fc (/3(n, k) — a(n, k)) — > as n — > oo, so h n (p, 1—p) also 
converges to / and we have proven (iii). Condition (i) clearly holds for n 
large enough. 

The remaining condition (ii) does not hold for a(n,k), (3{n,k), but as 
discussed in the previous section, we can get around this by defining 



(14) a(n,k) 



a(n, k) 



b(n, k) 



P(n,k) 



Note that for k/n < 1/9, we have a(n, k) = (3{n, k) = 2k/n so a(n, 
2(^Zi) is an integer, whence a(n, k) = b{n, k). 
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The sequences a(n, k), b{n, k) satisfy conditions (i)-(iv), and the tail prob- 
abilities ~P P (N > n) = h n (p, 1 — p) — g n (p, 1 — p) satisfy 

n / \ n 

P P (N>n)<Y / (P(n,k)-a(n,k))( n k )p k (l-pT- k + ]T 2p k (l-p) n - k 

k=0 ^ ' k=n/9 

(15) 

<^ £ (^)p fc (i-pr- fc +c 2 e- 2 ^+^. 

fc=n/2-3en ^ ' P 



The second term in (15) decays exponentially, and so does the third (we 
can use 4 • 2 -n / 9 as an upper bound). For the first term, ignore the square 
root factor and look at the sum; it is equal to P(Y/n > 1/2 — 3e), where 
Y has binomial (n,p) distribution. Since p < 1/2 — 4e, a standard large 
deviation estimate (see [7]) guarantees that the first term in (15) is bounded 
above by exp(— 2e 2 n), so it also decays exponentially in n. 

Thus we do have P p (iV > n) < Cp n if n is a power of 2. For general 
n, write 2 k < n < 2 k+1 . Then P p (iV > n) < P P (N > 2 k ) < Cp 2 " < C{p l / 2 ) n . 
The proof is complete. □ 



Remark. Most of the proof works for a general linear function f(p) = 
(ap) A (1 — as), for any a > 0. For integer a the whole proof works (with 
different constants). If a is not an integer, then the only problem comes 
from rounding the coefficients; the rounding error introduced is bounded by 
2~2k=oP k (^ ~ P) n ~ k i which still decays exponentially, but the rate of decay 
approaches 1 as p approaches 0. In the next section we deduce a slightly 
weaker version of the result for general a as a consequence of the case a = 2. 



Proposition 3 and Lemma 6 can also be used to obtain simulations for 
more general functions. The simulations are no longer guaranteed to be fast, 
but we do obtain some bounds for the tails of N: 



Proposition 10. Assume f satisfies e < f < 1 - e on (0, 1). Then: 

(i) // / is Lipschitz, then it can be simulated with ~P P (N > n) < D/^/n 
for some uniform D > 0. 

(ii) /// is twice differentiate , then it can be simulated with P p (N > n) < 
D/n for some uniform D > 0. 



Remark. Neither of these conditions guarantees that N has finite ex- 
pectation, though we do believe that this should be possible to achieve, at 
least for C 2 functions. 
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Proof of Proposition 10. As in the proof of Theorem 1, it is enough 
to define numbers a(n,k), (3(n,k) which satisfy assumptions (i), (iii) and 
(iv) in Proposition 3; assumption (ii) can then be achieved by rounding as 
described in Remark C. We set 

a(n, k) = f(k/n) - S n , (3(n, k) = f(k/n) + 5 n , 

with 5 n — > 0. Then (i) holds as soon as 5 n < e and (iii) holds because 
g n (p, l-p) = Q n (p) ~ S n , K(p, l—p) = Qn(p) + S n , where Q n are the Bern- 
stein polynomials. It remains to check (iv), and as in the proof of Theo- 
rem 1, it is enough to do it for m,n powers of 2, which amounts to checking 
that for hyper geometric X satisfying (9), we have a(2n,k) > Ea(n, X) and 
(3(2n, k) < E/3(ra, X). From Lemma 6, 

a(2n, k) - Bo(n, X) > 6 n - S 2n - C/V2n 

if / is Lipschitz with constant C, and 

a(2n, k) - Ea(n, X)>5 n - 5 2n - C/{An) 

if / is twice differentiable and \f"\ < C. The exact same inequalities hold for 
E/3(n,X) — (3(2n, k). Hence we can choose S n = (1 + \/2)C/y/n in the Lip- 
schitz case, and 5 n = C / (2n) in the twice differentiable case, and the proof is 
complete. 

□ 

4. Fast simulation for other functions. We start with some facts about 
random variables with exponential tails. 

Proposition 11. Let X > be a random variable. Then the following 
are equivalent: 

(i) There exist constants C > 0, p < 1 such that P(X > x) < Cp x Vx > 0. 

(ii) Eexp(tX) < oo for some t > 0. 

If these hold, we say X has exponential tails. 
Proof. Straightforward. □ 

Proposition 12. Let X{ > be i.i.d. with exponential tails, and let 
N > be an integer-valued random variable with exponential tails. Then 
Y = X\ + • • • + Xjsr has exponential tails. 

Proof. Take t > such that Eexp(tXi) < oo. Then we can find k > 
such that p = ~Eexp(t(X 1 - k)) < I. Let S n = £"=i X { . Then 

P(S N > kn) < P(JV > n) + P(S n > kn). 
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The first term on the right-hand side decreases exponentially fast. To 
evaluate the second term, we use a standard large deviation estimate, 

P(S n > kn) < exp(-tkn)Bexp(tS n ) = (Eexp(t(Xi - k))) n = p T \ 

so the second term also decreases exponentially fast and we are done. □ 

Remark. We do not assume that N is independent from the AYs. 

Proposition 13. Constant functions f(p) = c G [0, 1] have a fast sim- 
ulation on (0, 1). 

Proof. For f(p) = 1/2, we can use von Neumann's trick: toss coins in 
pairs, until we obtain 10 or 01; in the first case output 1, otherwise output 
(if we obtain 11 or 00, we toss again). We need 2iV tosses, where N 
has geometric distribution with parameter p 2 + (1 — p) 2 ; this clearly has 
exponential tails (unless p is or 1). 

For any other constant c, write it in base 2: c = X^nLi c n2 _n with c n G 
{0,1}, generate fair coins using von Neumann's trick, and toss them until 
we get a 1. Output cm, where M is the number of fair coin tosses. This 

scheme generates f(p) = c, and requires X\-\ + Xm p-coin tosses, where 

Xi is the number of p-coin tosses needed to generate the ith fair coin. All 
Xi have exponential tails and so does M, so Proposition 12 completes the 
proof. Note that the rate of decay of the tails depends on p but not on c; 
this will be used below. □ 

Proposition 14. Let S,T c [0, 1] . 

(i) If f,g have fast simulations on S, then the product f ■ g has a fast 
simulation on S. 

(ii) If f has a fast simulation on T and g has a fast simulation on S, 
where g(S) C T, then fog has a fast simulation on S. 

(iii) // /, g have fast simulations on S and f + g < 1 — e on S for some 
e > 0, then f + g has a fast simulation on S. 

(iv) If f,g have fast simulations on S and f — g > e on S for some e > 0, 
then f — g has a fast simulation on S. 

Proof, (i) Let Nf,N g be the number of inputs needed to simulate each 
function. We simulate / and g separately; if both algorithms output 1, we 
also output 1; otherwise, we output 0. This simulates / • g using Nj + N g 
inputs, which has exponential tails by Proposition 12. 

(ii) We simulate g using its algorithm, then feed the results to the algo- 
rithm for /. We need X\ + • • • + X^ f inputs, where Xi are i.i.d. with the 
same distribution as N g . This has exponential tails by Proposition 12. 
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(iii) We write / + g = ho ip, where h(p) = 2p and ij)(jp) = (f(p) + g(p))/2. 
We proved in the previous section that h has a fast simulation on [0, (1 — 
e)/2]. To simulate tp, we simulate / and g separately to obtain binary vari- 
ables Bj and B g , then toss a fair coin; if the coin is heads, we output Bf, 
otherwise we output B g . So ip can be simulated using Nf + N g + N inputs, 
where ./V is the number of inputs needed to simulate a fair coin. Hence ip 
also has a fast simulation, so (iii) follows from (ii). 

(iv) Clearly / has a (fast) simulation iff 1 — / has one, so we can look at 
1 — (/ — g) = (1 — /) + g < 1 — e. The conclusion then follows from (iii). □ 

Proposition 15. If a > 0, e > 0, the function f has a fast simulation 
on S, and af(p) < 1 — e on S, then a ■ f has a fast simulation on S. 

Proof. By Theorem 1, 2p has a fast simulation on [0, 1/2 — e). By the 
composition rule Proposition 14 (ii) , 2 n p has a fast simulation on [0, l/2 n — e). 
For general a > 0, find n with a < 2 n and write ap = 2 n {a/2 n )p. We know 
multiplication by 2 n has a fast simulation; so does multiplication by a/2 n , 
because constants smaller than 1 have a fast simulation. Hence their com- 
position ap has a fast simulation on [0, 1/a — e). We apply the composition 
rule Proposition 14(ii) again to complete the proof. □ 

Proposition 16. Let f(p) = Y^=o a nP n with a n > for all n. Let t G 
(0, 1] such that f(t) < 1. Then f has a fast simulation on [0,t — 2s], Ve > 0. 

Proof. Write 

n=0 

Since the terms ((t — £)/t) n (e/t) are the probabilities of a geometric distri- 
bution, we can generate an (e/t)f(p)-com as follows. First we obtain ./V with 
geometric distribution, so ~P p (N = n) = ((t — e)/t) n {e/t). Then we generate 
N i.i.d. p/(t — e)-coins (by Proposition 15, this can be done by a fast sim- 
ulation), and we generate one a^t -coin [since f{t) < l,a N t N < 1]. Finally, 
we multiply the N + 1 outputs as in Proposition 14(i). 

The number of coin tosses we need is X + Y\ + • • • + Y/v + Z, where X is 
the number of tosses required to obtain N, Y{ is the number of tosses re- 
quired to generate the ith p/(t — e)-coin, and Z is the number of tosses 
required to generate one (constant) ajyt N -coin. Yi have exponential tails by 
Proposition 15, and Z has exponential tails (whose rate of decay does not 
depend on the value of N) by Proposition 13. 

The way we obtain iV is we toss (t — e)/t-coins until we obtain a zero; 
hence X can itself be written as X = W\ + • • • + Wn, where Wi is the number 
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of tosses required to generate a constant (t — e)/i-coin. Hence by Proposi- 
tion 12, (e/t)f(p) has a fast simulation. 

Finally, / = (t/e)(e/t)f has a fast simulation by Proposition 15. □ 

Proposition 17. Let f(p) = J2'n=o a nP n have a series expansion with 
arbitrary coefficients a n € R and radius of convergence R > 0. Let e > and 
S C (0, 1) so that e < f < 1 — e on S, and sup S < R. Then f has a fast 
simulation on S. 

Proof. Separating the positive and negative coefficients, we can write 
f = g — h where g, h are analytic with radius of convergence at least R, 
and have nonnegative coefficients. They must also be bounded: g < M and 
h < M, with M = J2n=o \a n \(supS) n < oo. Then g/(2M), h/(2M) must have 
fast simulations on S by Proposition 16, so by Proposition 14, so does 
2M(g/(2M) h/(2M)). 

□ 

Proposition 18. If f, g have fast simulations on S, are both bounded 
on S, g > e on S, and f /g < 1 — e on S for some s > 0, then f / g has a fast 
simulation on S . 

Proof. Let M = sup#. Let C G (0, 1) and h(p) = C/(l-p) = Y,n=a c P n - 
By Proposition 16, this has a fast simulation on (0, 1 — C — e/(4M)). We can 
replace 1 — p with p by switching heads and tails; hence ip(p) = C/p has a 
fast simulation on (C + e/(4M), 1). Set C = e/(4M). Then tp has a fast sim- 
ulation on (e/(2M), 1) and so does g/{2M) £ (e/(2M), 1), so ip o g = e/(2g) 
has a fast simulation on S. So does the product / • (ip ° g) = (e/2)(/ /g), and 
by Proposition 15 so does f /g, since we know it is bounded above by 1 — e. 
□ 

Theorem 19. Let f be a real analytic function on a closed interval 
[a,b] C (0,1), so f is analytic on a domain D containing [a,b], and assume 
that f(x) S (0, 1) for all x 6 [a, b]. Then f has a fast simulation on [a,bj. 

Proof. If D is the open disk of radius 1 centered at the origin, then / 
has a series expansion with radius of convergence 1 and the result follows 
from Proposition 17. For a general D, the idea of the proof is to map one 
of its subdomains to the unit disk, using a map which has a fast simulation. 
See Figure 1. 

Using a standard compactness argument, it is easy to show we can find a 
domain E so that [a, b] C E C D and E is the intersection of two large open 
disks of equal radius. The centers of both disks are on the line Re(^) = (a + 
b)/2, located symmetrically above and below the real axis. The boundaries 



FAST SIMULATION OF NEW COINS FROM OLD 



17 



of the disks intersect on the real axis at the points a — t and b + 1 for some 

small t > 0. If we make the radius of the disks large enough, we may assume 

that the angle between the disks is ir/n for some large integer n. 

We shall use a Mobius map of the form (pz + q)/(rz + s) to map those 

disks into half-planes. Fix c > 0. The map 

c c 

( 16 ) 91 ( z ) = 1 7\ ~ TT7T\ — 1 7\ 

z — (a — t) (b + t) — (a — t) 

maps the boundaries of the disks into lines going through the origin, so it 
maps E to the domain between those two lines contained in the positive 
half-plane Re(z) > 0. The angle between the two lines is vr/n, so the map 
maps E to the positive half-plane. 

The map g 2 (z) = 1 — 2/(1 + z) maps the positive half-plane to the unit 
disk, so g2 o g\ maps E to the unit disk. Hence / o (gf) -1 o (<?2) _1 is real 
analytic on the unit disk (it is easy to check that the inverses of g\ and 52 
are analytic on their respective domains), so it has a fast simulation on any 
closed interval contained in (0, 1). It remains to check that 52 °5i maps [a, b] 
to such an interval, and that it has a fast simulation. Then it follows from 
Proposition 14(i) that / also has a fast simulation. 

For sufficiently large c, the function g\ maps the interval [a, b] to the 
interval \gi(b) , gi(a)] where 1 <g\(b). Hence l/gi maps [a, b] to some closed 
subinterval of (0, 1), and by Proposition 18 it has a fast simulation (as the 
ratio of two linear functions). Clearly, so does l/<?™. Finally, we can write 
92 °9i =93° (1/ffi )j where g 3 (z) =g 2 (l/z) = 1 - (2z)/(l + z) also has a fast 
simulation, by the same Proposition 18. This completes the proof. □ 



5. Necessary conditions for fast simulations. 



Proposition 20. Assume f has a fast simulation on an open set S C 
(0, 1). Then f is real analytic on S. 




D 

Fig. 1. The map gi. 
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Proof. Consider a fast algorithm, fix p and let f n {p) be the probability 
that it outputs 1 after exactly n steps. Then / = Y^Li fn and 

n oo 

o</(p) = E Mp)< c P n v ™>° 

i=l i=n+l 

for some constants C > 0,p < 1. Pick any B with 1 < B < 1/p. Since f n are 
polynomials, f n (z) is well defined for any complex z. We shall prove below 
that we can find e > so that for any complex z and positive integer n, 



(17) 



\fn{z)\ <B n f n {p) if|*- P |<£. 



Then for any m> n and z £ B(p, e) (the open ball with center p and radius 
e), we have 



E /« 

i=n+l 



< E \M z )\ < E Bi h(p) 

j=n+l i=n+l 
oo 

< B'Cp^ 1 = {Bp) n BC/{l- Bp). 

i=n+l 



Hence the sequence {J2?=i fi} is Cauchy on B(p,e), so it converges uni- 
formly on B(p,e) to a limit which is analytic by a standard theorem (see 
[1], page 176, Theorem 1). Hence / is real analytic. 

To prove (17), note that f n can be written as f n {z) = J2k=o a n,kZ k (l — 
z) n ~ k with a n> fc > 0. Since \z — p\ < e, we have \z\ < p + e and |1 — z\ < 
1 — p + e. Choose e so p + e < Bp and 1 — p + e < B(l — p). Then 

\z k {l - z) n ~ k \ <(p + e) k (l -p + e) n ~ k < B n p k (l - p) n ~ k 

and 



^2a n . k z k {\ 

k=0 



\n—k 



< £ a nM \z k {l - z) n ~ k \ < B n «n.,fcP fc (l - P) 
k=0 k=0 



n—k 



as desired. □ 



Proposition 21. Assume S C [0, 1] is closed and f has a fast simula- 
tion on S. Then the number of inputs N has uniformly bounded tails: there 
exist constants C,p which do not depend onp, so P p (iV > n) < Cp n , \/p £ S. 

Proof. Let g n (p) = ~P P (N > n). Just as in Proposition 20, g n can be 
written as g n (z) = Y^k=o a n.kZ k {\ — z) n ~ k with a n k > 0, so for any p £ (0, 1) 
and B > 1 we can find e > so 

(18) \g n (z)\<B n g n (p) if|z-p|<e. 
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For any p £ S H (0,1) we have g n (jp) < C p Pp for some C p > 0, p p < 1. Set- 

— 1/2 

ting B = p p in (18), we obtain that there exists e p > so 
SnO) < C p pp /2 \i ze(p-Sp,p + Ep). 

The intervals (p — e p ,p + e p ) cover 5. Since <S is closed, it is compact, so 
we can find a finite subcover (p, — e Pi ,pi + e Pi ), l<i< N. Then we can set 

1 /2 

C = max C Pi , p = max p p ! . . □ 

Remark. This also shows that if a function has a simulation on some 
S C (0, 1), then the set of p where the simulation is fast is open in S. 

Proposition 22. Assume f has a simulation on an open set S C (0, 1), 
such that the number of inputs needed N has finite kth moment on S, and 
furthermore the tails of the moments decrease uniformly: lim^^oo E p N k t(N > 
n) = uniformly in p £ S. Then f £ C k (S) (i.e., f has k continuous deriva- 
tives on S). 



PROOF. Let /„ be defined as in Proposition 20. Since f — Xm=i fn> ^ 
is enough to prove that the series J2^=i fn^ converges uniformly on S. We 
shall prove that \fn \ < Cn k f n for a uniform constant C. Then 

oo oo 

E \fn ] \ < E Cnk fn = CB p N k l(N > m - 1) 

n=m n=m 

converges to zero uniformly as m — > oo, so the series is Cauchy and we are 
done. To prove the required inequality, recall that f n (p) = Yl?=o a n,iP l (^ — 
p) n ~ % with a n ,i > 0. Write = i(i — 1) • • • (i — j + 1). From Leibniz's formula 
for the derivative of a product, 



|(p*(l-p)«-i)( fc )| 



k 

E( - W) ( ^(i-p) n - i ) (fc - J) 



j=0 

k 



j=0 KJ J 
k 

< E(&O n V(l -p) n "Vmin(p, 1 
j=0 

<CnV(l-p) n " ! 

for C = k(k\)/ infggB min^, 1 — where the inf is taken over some small 
neighborhood B of p. It follows that |/n^| < Cn k f n on S\ □ 
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Proposition 23. Assume f has a simulation on a closed interval I C 
(0, 1), such that the number of inputs needed N has sup pg / E p (iV) < oo. Then 
f is Lipschitz over I. 

PROOF. We are given that B p N = Y^=i n fn <C <oo. Since I is closed, 
/ C (e, 1 — e) for some e. As in the previous proposition, we obtain \ f' n \ < 
nf n / min(e, 1 — e). Hence | Ya=i fl\ — C/ min(e, 1 — e) so 

n n 

"/i(p)-E/*(9) 



< \p — q\C I min(e, 1 — e) 
Letting n — > oo completes the proof. □ 



/ j % \y ) / 
i=l i=l 



6. An approximate algorithm for doubling. The methods described in 
the previous sections are essentially constructive. Proposition 3 gives a recipe 
for constructing an algorithm, given an approximation; all that is needed is 
an ordering of all binary words of length n with k Vs. 

In the particular case of the function f(p) = 2p, there exists an extremely 
simple algorithm. It also works for any p £ (0, 1/2); there is no need to bound 
the function away from 1. The catch is that it is approximate: it outputs 1 
with probability very close to 2p, with the error decaying exponentially in the 
number of steps. This must be, of course; the Keane-O'Brien results show 
that we could not have an exact algorithm with these properties. However, 
in practice, an approximate result may suffice. 

Proposition 24. Letp< 1/2 and consider an asymmetric simple ran- 
dom walk S n = X\ + • ■ • + X n , with P p (Xi = 1) = p = 1 — P p (Aj = — 1). 
Let A n be the event that max(Si, . . . , S n ) > 0. Then P p (A n ) = J2k=o(2k/nA 
— p) n ~ k = Qn{p), where Q n is the nth Bernstein polynomial of the 
function f(p)=2pAl. 

Proof. We need to show that the number of paths with k positive 
steps among the first n steps, and max(Si, . . . , S n ) > 0, is (2k/n A 1)()!)- 
For k > n/2, this is obvious. For k < n/2, (2fc/n)(£) = 2(^"|) and the result 
follows from the reflection principle (see, e.g., [3], page 197). □ 

Since / is piecewise linear, its Bernstein polynomials converge to it expo- 
nentially fast (except at p = 1/2), so we obtain the following. 

Algorithm. Run an asymmetric simple random walk S n = X\ + ■ • ■ + 
X n , with P p (Aj = 1) = p = 1 — P p (Aj = — 1) for at most n steps. If the walk 
ever reaches nonnegative territory (S& > for some 1 < k < n), output 1. 
Otherwise, stop after n steps, output 0. 
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A standard large deviation estimate (see [7]) shows that if p < 1/2, the 
probability of outputting 1 is 2p — e, where < e < 2exp(— 2n(l/2 — p) 2 ). 
See [5] for another construction of an approximate doubling algorithm. 

7. Continuous functions revisited. In this section we use Proposition 3 
to simulate any continuous function / that satisfies e< / < 1 - e on (0,1) 
for some e > 0. Our proof is simpler than the original proof of [8]. However, 
their argument is more general since it does not assume that / is bounded 
away from and 1. We will use the following theorem of Polya: 

Theorem 25. Let q(x,y) be a homogeneous polynomial with real co- 
efficients satisfying q(x,y) > 0, Vx > 0,y > 0. Then for some nonnegative 
integer n, all coefficients of (x + y) n q(x, y) are nonnegative. 

See [6], pages 57-59, for a proof. This clarifies the connection between the 
partial order ^ in Definition 2 and the pointwise partial order. It says that 
if q(x, y) < r(x, y) for all x, y > 0, then (x + y) n q(x, y) -< (x + y) n r(x, y) for 
some n. 

Theorem 26 ([8]). Let e > and suppose that f : (0,1) h-> [e, 1 - e] is 
continuous. Then f admits a terminating simulation. 

Proof. Let i satisfy 2~ l < e/4. By Proposition 5, we can approximate 
/ — 3 • 2~ l by a Bernstein polynomial q mi of sufficiently high degree rrii with 
error smaller than 2~ l . More precisely, 

mi , s 

q mi (x,y) = E (7 J W*/"*) " 3 • 2 ~>V ni ~ fe 

will satisfy f(p) - 4 • 2~ l < q m .(p, 1 - p) < f(p) - 2 ■ 2~ l for all p G (0, 1). 
The sequence q mi (p, 1 — p) is increasing in i, so 

q mi (x,y)(x + y) mt+1 ~ mi <q mi+1 (x,y) Vx, y >o. 

By Theorem 25, 

q mi (x, y){x + y^+i-^+s, qmi+i ( Xj y ^ x + 

for some integer Sj > 0. Thus if we define n\ = m\ and more generally, 
Hi = mi + (si + • • • + Si-i), then the homogeneous polynomials 

9n t (x,y) = q mi (x, y) (x + y) n ^ 

satisfy conditions (i), (iii) and (iv) in Proposition 3 along the subsequence 
{ni}. Condition (ii) is easily obtained by the rounding process described 
in Remark C. By Remark B, once we have g n for the subsequence n = nj, 
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we can define it for all n. A similar construction can be used to define 
approximations from above h n . (In fact, these approximations will require 
another sequence {s^} analogous to {si} above, and for consistency we need 
to use max{sj,s^} in both approximations.) Hence by Proposition 3, / has 
a terminating simulation algorithm. □ 

8. Open problems. Theorem 2 does not settle the issue of what happens 
near and 1, or on the boundary of the domain of analyticity of a function. 
An interesting example is the square root function f(p) = yjp. Our methods 
provide fast simulations on any interval (e, 1], but if p is allowed to take 
any value in (0, 1), the best result we are aware of is the one in [10], where 
the authors construct a simulation using a random walk on a ladder graph. 
Estimates for the tails of the number of inputs needed N are then given 
by return probabilities for a simple random walk, so ~P P (N > n) decays like 
n -1 / 2 . We do not know whether one can do better. 

Question 1 . Is there an algorithm that simulates ^fp on (0,1), for which 
the number of inputs needed has finite expectation for all pi 

Remark. Entropy considerations (see [2], page 43) imply that if an 
algorithm as in Question 1 exists, then the expectation of the number of 
inputs cannot be uniformly bounded on (0, 1). Indeed, this expectation must 
be at least H(^/p)/ H(p), where H(p) = — p\og(p) — (1 — p)log(l — p) is the 
entropy function. 

Question 2. Let J C (0, 1) be a closed interval and let /:Jh> (0, 1) be 
continuous. Suppose that we have a simulation algorithm that takes as input 
a sequence {Xi} of i.i.d. p-coins and produces a sequence of i.i.d. /(p)-coins. 
The rate of the algorithm (when it exists) is defined to be the limit as n — ► oo 
of 1/n times the expected number of f(p) coins produced from the first n 
inputs. The rate can never exceed the entropy ratio H(p)/ H(f(p)); see [2]. 
Given J and /, are there simulation algorithms with rates arbitrarily close 
to the entropy ratio, uniformly for all p£ J? 

A positive answer is known for constant /: for f(p) = 1/2 variants of the 
von Neumann scheme (see [4, 11]) will do, and other constants follow from 
combining these with [9]. However, for nonconstant / [except the identity 
and f(p) = 1 — p] the situation is unclear; a good example to ponder is 
f(p)=P 2 - 

We would also like to know whether Proposition 22 can be improved. 

Question 3. Is it true (possibly subject to some technical conditions) 
that a function has a simulation where the number of inputs has uniformly 
bounded kth moment, if and only if it has k continuous derivatives? 
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